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Docket No.: 49959-039 Patent 

HYBRID INVARIANT ADAPTIVE 
AUTOMATIC DEFECT 
CLASSIFICATION 

FIELD OF THE INVENTION 

The present invention relates to a method and apparatus for 
automatically classifying defects on the surface of an article. The invention 
has particular applicability for in-line inspection of semiconductor wafers 
during manufacture of high density semiconductor devices with submicron 
design features. 

BACKGROUND ART 

Current demands for high density and performance associated with 
ultra large scale integration require submicron features, increased transistor 
and circuit speeds and improved reliability. Such demands require formation 
of device features with high precision and uniformity, which in turn 
necessitates careful process monitoring, including frequent and detailed 
inspections of the devices while they are still in the form of semiconductor 
wafers. 

Conventional in-process monitoring techniques employ an "inspection 
and review" procedure wherein the surface of the wafer is initially scanned by 
a high-speed, relatively low-resolution inspection tool; for example, an opto- 
electric converter such as a CCD (charge-coupled device) or a laser. 
Statistical methods are then employed to produce a defect map showing 
suspected locations on the wafer having a high probability of a defect. If the 
number and/or density of the potential defects reaches a predetermined level, 
an alarm is sounded, indicating that a more detailed look at the potential defect 
sites is warranted. This technique is known as "total density monitoring" of 
defects and produces a statistic called the "total defect density". 



When the defect density reaches a predetermined level, a review of the 
affected wafers is warranted. After a redetection procedure is carried out, 
using the defect map, to positively determine the presence of defects, a more 
detailed review procedure is carried out on the individual defect sites, such as 
5 scanning with a CCD to produce a relatively high-resolution image. The 
defect image is then analyzed to determine the nature of the defect (e.g., a 
defective pattern, a particle, or a scratch). 

Since it has recently been recognized that monitoring "classified defect 
density", i.e., the number of defects of several different types, or "classes" of 
10 defects, is preferable to monitoring total defect density, various methods for 
classification of defects have been introduced. Most of these conventional 
methods, called "classic classifiers" herein, employ pattern recognition 
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-S techniques wherein a set of sample defects is acquired, imaged and analyzed 

for particular characteristics or "predicates" (e.g., brightness, roughness, size, 

: f? 15 color), which predicates are fed into a "black box" (e.g., a neural net) and used 

Q 

: £ to train the classifier to recognize different types of defects by the defects' 
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7" predicates. 

2 Disadvantageously, the efficiency of these methods is reduced because 

j«* there is no agreed-upon set of defect classes. Different semiconductor 

20 fabricators consider different defects to be important and, therefore, use 
different sets of defect classes tailored to their specific needs. Thus, they 
require many examples of defect images to be obtained for each defect class 
prior to becoming operational. Consequently, typical prior art systems cannot 
be used during start-up and ramp-up of a production line. Furthermore, 
25 because such classifiers, also referred to as "full classifiers" herein, need to 
discriminate between all defect types required to be classified (e.g., 10 or more 
classes of defects), a large number of predicates must be considered when 
classifying any defect, thus increasing inspection time and reducing 
production throughput. 
30 To address the above-mentioned problems associated with full 

classifiers, an invariant core classifier ("core classifier" herein) has recently 
been introduced in the defect review system marketed as the SEMVision™, 
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available from Applied Materials of Santa Clara, California. Such a core 
classifier is described in copending U.S. Patent Application Serial No. 
09/1 1 1,454, filed July 8, 1998, entitled "Automatic Defect Classification With 
Invariant Core Classes", the entire disclosure of which is hereby incorporated 
by reference. 

According to the methodology of the copending application, after a 
defect map of a semiconductor wafer has been generated, each defect site and 
a corresponding known non-defective reference site is imaged by a scanning 
electron microscope (SEM) to gather and store location and topographical 
data. The image is then analyzed, as by performing boundary analyses and/or 
topographical measurements, to classify the defect as being in one of a number 
(e.g., seven) of invariant core classes of defect, and further classified as being 
in one of an arbitrary number of core sub-classes as desired by the user by 
adding pre-programmed modules onto the core classifier. 

Fig. 1 is a conceptual flow chart of automatic defect classification into 
core classes performed by the methodology of the copending application. A 
defect 1 is classified broadly as a pattern defect 2A or a particle defect 2B, and 
further placed into one of seven exemplary invariant core classes of defects: 
craters and microscratches on the wafer surface 3A, a missing pattern on the 
surface 3B, an extra pattern on the surface 3C, a deformed pattern on the 
surface 3D, a particle on the surface 3E, a particle embedded in the surface 3F, 
or a particle and a deformed pattern on the surface 3G. Arbitrary core 
subclasses may include bridging (i.e., short circuiting) between neighboring 
wiring patterns, a small particle, a large particle, a broken line, a narrow 
pattern, etc. 

The invariant core classification technique of the copending 
application enables defects to be separately and reliably classified as particle 
or pattern defects, and as on-surface or below-surface (embedded) defects. It 
also provides early quantification and notification of these meaningfully 
classified defects, thereby facilitating investigation of the causes of the 
defects, and enabling early corrective action to be implemented. 



The core classifier of the copending application is a "rule-based" 
classifier in that it classifies defects by collecting defect information (i.e., 
imaging the wafer surface and performing boundary analysis and/or 
topographical measurement of its features) then following a set of rules 
programmed a priori (i.e., beforehand). Thus, it does not need to be trained, 
as do classic classifiers, and so does not require examples of defect images for 
each class prior to being operational. Therefore, unlike prior art defect 
classification systems, the core classifier of the copending application can be 
used during the start-up and ramp-up of a production line. 

While core classifiers as described in the copending application 
address many of the shortcomings of conventional classic classifiers, core 
classifiers may not be suitable for separating defects into every class deemed 
important by a user since, as rule-based classifiers, they cannot be easily 
adapted to recognize new classes of defects. Specifically, the user may require 
refinements within the invariant core classes (since different process lines may 
be sensitive to different defects from one to another) other than the core 
subclasses discussed above available as pre-programmed modules to be added 
to the core classifier. Furthermore, the user may require refinements that 
cannot be discerned by the core classifier. For example, if the core classifier 
classifies a defect as a particle on the surface (core class 3F in Fig. 1), and the 
user wishes to know the shape of the particle in combination with its size, 
another technique must be used to obtain this size information, which is 
helpful in pinpointing the source of the particle, since different processes tend 
to produce different particle shapes and sizes. Additionally, "exotic" defects 
that do not fall into any of the core classes cannot be classified by a core 
classifier. For example, if a process is introduced that results in a new type of 
defect, the existing core classes will be irrelevant in relation to the new defect, 
and the core classifier will not be able to classify the new defect unless the 
new defect is added as a core class. 

There exists a need to quickly and meaningfully review semiconductor 
wafers and automatically classify defects using a core classifier, then further 
classify the defects into subclasses within a core class desired by the user in 



order to identify processes causing defects, thereby enabling early corrective 
action to be taken. This need is becoming more critical as the density of 
surface features, die sizes, and number of layers in devices increase, requiring 
the number of defects to be drastically reduced to attain an acceptable 
manufacturing yield. 

SUMMARY OF THE INVENTION 

An object of the present invention is to provide a method and apparatus 
for automatic, fast, reliable and complete classification of defects in 
semiconductor wafers. 

According to the present invention, the foregoing and other objects are 
achieved in part by a method of classifying a defect on the surface of an 
article, which method comprises imaging the surface, classifying the defect as 
being in one of a predetermined number of core classes of defects using a core 
classifier, and classifying the defect as being in one of an arbitrary number of 
variant subclasses using a specific adaptive classifier associated with the one 
core class. If the core classifier cannot classify the defect into a core class, a 
full classifier is used to classify the defect. 

A further aspect of the present invention is an apparatus for carrying 
out the steps of the above method. 

A still further aspect of the present invention is a computer-readable 
medium bearing instructions for automatically classifying a defect on the 
surface of an article, the instructions, when executed, being arranged to cause 
one or more processors to perform the steps of the above method. 

Additional objects and advantages of the present invention will 
become readily apparent to those skilled in this art from the following detailed 
description, wherein only the preferred embodiment of the invention is shown 
and described, simply by way of illustration of the best mode contemplated for 
carrying out the invention. As will be realized, the invention is capable of 
other and different embodiments, and its several details are capable of 
modifications in various obvious respects, all without departing from the 



invention. Accordingly, the drawings and description are to be regarded as 
illustrative in nature, and not as restrictive. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Reference is made to the attached drawings, wherein elements having 
the same reference numeral designations represent like elements throughout, 
and wherein: 

Figure 1 is a conceptual flow chart of defect classification according to 
the copending invention. 

Figure 2 is a conceptual flow chart of defect classification according to 
the present invention. 

Figure 3 illustrates a semiconductor wafer to be inspected using the 
present invention. 

Figures 4A-4C are representations of images of a defect to be 
inspected by the present invention. 

Figure 5 is a block diagram of a review tool according to the present 
invention. 

Figure 6 is a flow chart illustrating sequential steps of a method 
according to the present invention. 

Figure 7 is a function block diagram of an inspection tool according to 
the present invention. 

Figure 8 is a block diagram that illustrates an embodiment of the 
invention. 

DESCRIPTION OF THE INVENTION 

Core classifiers, such as described in copending U.S. Application 
09/111,454, enable defects to be quickly and reliably classified as particle or 
pattern defects, and as on-surface or below-surface (embedded) defects, and 
are operable during start-up and ramp-up of a production line. However, core 
classifiers cannot classify certain types of defects within its core classes, and 
are not easily modified by the user to add defect classes. Conventional classic 
classifiers, on the other hand, are fully adaptable to classify any type of defect 



as trained by the user, but require many exa mples of defect images to b e 
obtained for each defect class prior to becomin gjo perational . Thus, classic 
classifiers cannot be used du ring sta rt- up and ramp-up o f a production line. 
Furthermore, classic-style full classifie rs are slower than core classifiers, and 
5 generally less accurate. The present invention addresses and solves these 
problems by providing a hybrid defect classification methodology utilizing a 
core classifier together with limited-range classic-style classifiers for fast and 
reliable classification of defects into invariant core classes and variant 
subclasses as needed by the user. Thus, the present methodology enables 
10 ready identification of processes causing defects, thereby enabling early 
corrective action to be taken. 

According to certain embodiments of the methodology of the present 
Q invention, after a defect map of a semiconductor wafer has been generated, 

V? each defect site and a corresponding known non-defective reference site is 

ff 15 imaged by a scanning electron microscope (SEM) to gather and store location 

C and topographical data. This_data_is then-analyzed_fe y a core classifier usin g 

boundary an alysis and topograp hical techniques to clas sifiMthe d efect as being 
in one of a numbe r of predetermined core classesjofdefect (the core class may 
be one of a number of invariant core classes or a variant class programmed 
20 into the core classifier as desired by the user). The defect is then further 
S classified into one of an arbitrary number of variable subclasses (i.e., 

subclasses that cannot be discerned by the core classifier) using a "specific 
adaptive classifier" associated with the core class in which the defect has been 
placed, which employs classic classifier methodology on a limited scale, for 
25 efficiency. If the core classifier or specific adaptive classifier cannot classify 
the defect into a core class, the defect is analyzed by a conventional full 
classifier. 

The specific adaptive classifier of the present method^ogy^operates 
( rbased^njh^sgne princi ples as a clas sic^class^fier, in that it is trained by the 
30 user with a set of sample defect images to classify a defect based on the 
defect's predicates. However, unlike conventional full classifiers that must 
classify every defect as being in one of many classes (e.g., 10 classes), the 
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specific adaptive classifier analyzes only defects of the core class or classes 
with which it is associated (e.g., 1, 2 or 3 classes). Thus, the specific adaptive 
classifier is not trained to consider that the defect may be in one of many 
classes; rather, the specific adaptive classifier is trained to start its analysis of 
5 the defect "knowing" that the-defect is in a particular core class. For example, 
a specific adaptive classifier associated with the "particle on surface" core 
class 3E in Fig. 1 is able to quickly and accurately determine the attributes of a 
particle on the surface as required by the user, because it does not need to first 
determine that the defect is a particle rather than a pattern defect (such as 3B- 
10 3D) or a scratch (3 A). In other words, the specific adaptive classifier is 
presented with a "sub-problem" (e.g., a defect that it knows is in one of, say, 2 
or 3 classes) that is easier to solve than the total problem (a defect that could 

ijj be in one of 1 0 or more classes). This limit on the domain of the specific 

i-tl 

;~f £ adaptive classifier enables it to classify defects more reliably and faster than a 

■5 15 conventional classic classifier, since the specific adaptive classifier needs to 

5 consider less predicates to reach its decision in most situations. That is, 

^ because the specific adaptive classifier knows what class the defect is in, it 

3 will consider a particular set of predicates first and not consider predicates 

1^ irrelevant to the core class with which it is associated. 

:< y 20 The present invention utilizes the best attributes of three different 

tfj types of classifiers to perform defect classification more quickly and reliably 

than prior art methodologies. It employs a core classifier for fast initial 
classification of defects into a predetermined number of core classes, the core 
classifier being able to work during start-up and ramp-up of a production line 
25 because it does not require examples of defects. It also uses specific adaptive 
classifiers in parallel with the core classes for local refinement within a core 
class. The present invention further employs a full classifier, but only on a 
limited basis for special types of defects that do not fit in to the core 
classification scheme. Thus, as the fabrication process matures and the types 
30 of defects of interest become more diverse, the specific adaptive and full 
classifiers can be trained as needed to perform more detailed defect 
classification. 
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Fig. 2 is a conceptual flow chart of automatic defect classification 
performed by the methodology of the present invention. A defect 1 is 
classified broadly by a core classifier, as discussed above, as a pattern defect 
2A or a particle defect 2B, and further placed into one of seven exemplary 
5 invariant core classes of defects 3A-3G. The core classifier can further place 
the defects in core subclasses (not shown) including bridging (i.e., short 
circuiting) between neighboring wiring patterns, a small particle, a large 
particle, a broken line, a narrow pattern, etc. as desired by the user by adding 
pre-programmed "modules" onto the core classifier. Defects in each of the 
10 core classes 3A-3G are then classified into an arbitrary number of variable 
subclasses SC 4A1-4G2 as desired by the user by specific adaptive classifiers 
trained by the user. Alternatively, defects can be classified by a specific 
adaptive classifier into a subclass such as 4H directly after being broadly 
classified as a particle or pattern defect, as shown by the dashed line in Fig. 2. 
15 The decision as to how far the core classifier classifies a defect before the 
specific adaptive classifiers are used is based on the experience of the user in 
an effort to obtain the best classification results. Defects that cannot be 
classified by the core classifier or the specific adaptive classifiers are sent to a 
full classifier trained by the user that classifies them into an appropriate class 



An embodiment of the present invention is illustrated in Figs. 3-6. As 
shown in Fig. 3, a semiconductor wafer W to be inspected for defects has a 
plurality of patterned integrated circuit dies 1000. Initially, a defect map is 
produced by conventional techniques, such as by scanning the surface of a 

25 wafer with a high-speed inspection tool (a CCD, a laser or an SEM may be 
employed for this purpose), then using statistical methods, typically involving 
algorithms and/or grey-scale analysis, to identify suspected locations on the 
wafer having a high probability of having a defect. 

Next, as shown in Figs. 4A-4C, a redetection procedure is carried out 

30 at each suspected defect location to determine the exact location of the defect. 
A conventional CCD scanner or an SEM may be used to image a pattern 10 at 
a suspected defect location, which is then compared to a reference pattern 20 



4 



10 



at a corresponding location on an adjacent or other die on the same wafer 
which is not suspected of having a defect. If a difference 30 is found between 
the suspected defective pattern 10 and the reference pattern 20, the suspected 
defective pattern 10 is determined to be a defect, and the inventive analysis 
5 and classification commences. 

The present invention is implemented at a review tool 500, as shown in 
Fig. 5, comprising an imager 510, preferably a high resolution SEM such as 
the SEMVision™ available from Applied Materials, Santa Clara, California, 
or as described in the copending application. Review tool 500 further 
10 comprises a processor 520, which preferably performs the analysis disclosed 
herein electronically, and a monitor 530 for displaying results of the analyses 
of processor 520. Processor 520 can be in communication with a conventional 
wafer reference design database 550 and a memory device 540, such as a 
semiconductor memory. 
15 Fig. 6 is a flow chart of an embodiment of the inventive methodology, 

and Fig. 7 is a functional block diagram of processor 520. At step 600, the 
defect is imaged by imager 510 as discussed above. Next, the defect image is 
analyzed in step 610 by a core classifier 700 as disclosed in the copending 
application, by performing a boundary analysis of the image and/or a 
20 topographical analysis of the image. If it is determined at step 620 that core 
=0 classifier 700 cannot classify the defect into a predetermined core class (e.g., 

classes 2A-B or 3A-3G in Fig. 2), the defect image is analyzed by full 
classifier 710 at step 630. 

If core classifier 700 classifies the defect, it is determined at step 640 
25 whether further classification into a subclass of a core class is required. If not, 
the classification procedure ends. If further classification into a subclass is 
needed, the defect image is further analyzed by one of specific adaptive 
classifiers 720a-720e associated with the core class 2A, 2B or 3A-3G into 
which the defect was classified by core classifier 700 (see step 650). For 
30 example, if core classifier 700 classifies the defect as a particle on the surface 
(i.e., core class 3F) or a particle embedded in the surface (core class 3E) and 
the user wishes to know the shape in combination with the size of the particle, 
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one of the specific adaptive classifiers 720a-e, that has been trained by the user 
to determine particle shape/size if it is given a known particle defect, then 
further analyzes the defect image. If the appropriate specific adaptive 
classifier 720a-720e successfully classifies the defect as desired, the 
5 classification procedure ends at step 660. If not, the defect is further analyzed 
by full classifier 710 at step 630. 

To achieve the advantages of reduced inspection time and increased 
classification reliability of the present methodology, specific adaptive 
classifiers 720a-e are each associated with (i.e., analyze known defects from) a 
10 limited number of core classes 2 A, 2B, 3A-3G, e.g., two or three core classes. 
Each specific adaptive classifier 720a-e is trained to identify defects of a 
subclass by providing the specific adaptive classifier a set of predicates (i.e., 
feature descriptors) of a number of examples of known defects of the subclass. 
In operation, the specific adaptive classifier analyzes the same predicates of a 



15 defect to be classified to determine if the defect falls in the subclass. 



j£ Because specific adaptive classifiers 720a-e only classify defects of a 

l ^ small number of core classes, each can be trained to expect only defects from 

Q those classes; in other words, each specific adaptive classifier can be trained to 

begin its analysis by looking at a particular set of predicates of the defect 
!Jj 20 image. This list of predicates is generally much shorter than the list of 

Q predicates a conventional full classifier must consider as it begins its analysis 

of an unclassified defect. Specific adaptive classifi grs typically consider only _ 
about 4-7 predicates, while conventional full^classifiers typ icall y utilize 40- 
100 predicates. For example, a specific adaptive classifier used to further 
25 classify a defect determined by core classifier 700 to be a particle defect 
would have been trained to first consider predicates such as size, brightness, 
roughness and roundness of the p article. In contrast, a specific adaptive 
classifier for further classifying a defect determined by core classifier 700 to 
be a pattern defect would typically be trained to first consider predicates such 
30 as pattern size, distance from other patterns, area of the pattern, etc.; but not 
brightness, roundness and roughness, since these predicates are not necessarily 
relevant to pattern defects. Additionally, a specific adaptive classifier for 
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further classifying defects determined to be scratches by core classifier 700 
would typically be trained to consider yet another different set of predicates, 
such as scatter, elongation and non-circularity. Thus, by being trained to look 
for a relatively narrow range of defects, specific adaptive classifiers 720a-e 
enable further classification of defects after core classification with a 
minimum of inspection time. 

Specific adaptive classifiers 720a-e are typically configured to be 
trained using^well -known classic classifier techni ques^s uch as ji ^decision 
tree" tech nique or a "m ultidimensional clustering" technique. The decision 
tree technique typically comprises the cl assifier analyz ing_a_first_ predicate of 
the defect image, then using the value of th e first predicate to dgte nnine jhe 
next predi cate to be analyzed, and so on until the defect is clas sified. In the 
multidimensional clustering technique, the classifier is trained to consider a 
number of predicates "x" of the known sample defects, each sample defect 
being treated as a point having x coordinates hanging in x-dimensional space. 
The classifier executes algorithms to plot the points and determine whether 
they are clustered together. If so, the classifier connects the points (e.g., with 
an ellipsoid) and deems that every point inside the ellipsoid represents the 
same kind of defect as the sample defects. Of course, any technique used to 
construct a classic classifier can be used to construct a specific adaptive 
classifier. 

Moreover, a spec ific adaptive classi fier can be given a set of rules to 
follow to classify defect s, rather than being trained, or can op erate using a 
combina tion of rule-based and learned (i.e., trained) classification techniques. 
For example, if the core classifier classifies a defect as a particle, and the 
specific adaptive classifier is to determine the size of the particle, the specific 
adaptive classifier can be programmed to "measure" the particle's diameter 
using information gathered by the core classifier and then follow an algorithm 
to classify the particle as a "large particle" (e.g., above xjam diameter), "small 
particle" (e.g., below x|im diameter), etc. as determined by the user. 
Alternatively, the specific adaptive classifier can be trained to analyze particle 
size by giving the specific adaptive classifier a number of examples of particle 
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defects, using a multidimensional clustering technique to plot the particles 
according to their size, and attempting to cluster them into a number of size 
categories (e.g., two, three, or four separate categories) such as "small", 
"medium", "large", etc. as determined to be significant by the classifier. If the 
5 specific adaptive classifier cannot cluster the defects; e.g., if they are found to 
be evenly or randomly distributed rather than clustered, then the user will need 
to design a set of rules for classifying the particles by size, as by using an 
algorithm as discussed above. 

Figure 8 is a block diagram that illustrates an embodiment of the 
10 invention shown in Fig. 5. According to this embodiment, processor 520, as 
shown in Fig. 5, includes a bus 802 or other communication mechanism for 
_ communicating information, and a central processing unit (CPU) 804 coupled 

JO with bus 802 for processing information. Processor 520 also includes a main 

memory 806, such as a random access memory (RAM) or other dynamic 
1 5 storage device, coupled to bus 802 for storing information and instructions to 
be executed by CPU 804. Main memory 806 also may be used for storing 
7* temporary variables or other intermediate information during execution of 

O instructions to be executed by CPU 804. Processor 520 further includes a read 

en 

M. only memory (ROM) 808 or other static storage device coupled to bus 802 for 

! ^ 20 storing static information and instructions for CPU 804. A storage device 810, 

such as a magnetic disk or optical disk, is provided and coupled to bus 802 for 
storing information and instructions. Storage device 810 may also serve as 
memory 540 in Fig. 5. 

Processor 520 may be coupled via bus 802 to monitor 530, such as a 
25 cathode ray tube (CRT), for displaying information to a computer user. An 
input device 814, including alphanumeric and other keys, is coupled to bus 
802 for communicating information and command selections to CPU 804. 
Another type of user input device is cursor control 816, such as a mouse, a 
trackball, or cursor direction keys for communicating direction information 
30 and command selections to CPU 804 and for controlling cursor movement on 
monitor 330. 
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Imager 510 (Fig. 3) inputs data representative of images of a 
semiconductor wafer under inspection, as discussed above, to bus 802. Such 
data may be stored in main memory 806 and/or storage device 540, and used 
by CPU 804 as it executes instructions. Imager 510 may also receive 
5 instructions via bus 802 from CPU 804. 

Likewise, database 550 (Fig. 3) inputs data representative of a 
substantially defectless wafer, as discussed above, to bus 802. Such data may 
be stored in main memory 806 and/or storage device 540, and used by CPU 
804 as it executes instructions. 

10 The invention is related to the use of processor 520 for inspecting the 

surface of a semiconductor wafer for defects. According to one embodiment 
of the invention, inspection of the surface of a semiconductor wafer, including 
classification of surface defects, is provided by processor 520 in response to 
CPU 804 executing one or more sequences of one or more instructions 

15 contained in main memory 806. Such instructions may be read into main 
memory 806 from another computer-readable medium, such as storage device 
810. Execution of the sequences of instructions contained in main memory 
806 causes processor 804 to perform the process steps described above. One 
or more processors in a multi-processing arrangement may also be employed 

20 to execute the sequences of instructions contained in main memory 806. In 
alternative embodiments, hard-wired circuitry may be used in place of or in 
combination with software instructions to implement the invention. Thus, 
embodiments of the invention are not limited to any specific combination of 
hardware circuitry and software. The programming of the apparatus is readily 

25 accomplished by one of ordinary skill in the art provided with the flow chart 
of Fig. 6. 

The term "computer-readable medium" as used herein refers to any 
medium that participates in providing instructions to CPU 804 for execution. 
Such a medium may take many forms, including but not limited to, non- 
30 volatile media, volatile media, and transmission media. Non-volatile media 
include, for example, optical or magnetic disks, such as storage device 810. 
Volatile media include dynamic memory, such as main memory 806. 



15 



Transmission media include coaxial cable, copper wire and fiber optics, 
including the wires that comprise bus 802. Transmission media can also take 
the form of acoustic or light waves, such as those generated during radio 
frequency (RF) and infrared (IR) data communications. Common forms of 
5 computer-readable media include, for example, a floppy disk, a flexible disk, 
hard disk, magnetic tape, any other magnetic medium, a CD-ROM, DVD, any 
other optical medium, punch cards, paper tape, any other physical medium 
with patterns of holes, a RAM, a PROM, and EPROM, a FLASH-EPROM, 
any other memory chip or cartridge, or any other medium from which a 
1 0 computer can read. 

Various forms of computer-readable media may be involved in 
carrying out one or more sequences of one or more instructions to CPU 804 
;0 for execution. For example, the instructions may initially be bome on a 

UJ magnetic disk of a remote computer. The remote computer can load the 

X 15 instructions into its dynamic memory and send the instructions over a 

:: P telephone line using a modem. A modem local to processor 520 can receive 

the data on the telephone line and use an infrared transmitter to convert the 
!^[ data to an infrared signal. An infrared detector coupled to bus 802 can receive 

the data carried in the infrared signal and place the data on bus 802. Bus 802 
f« 20 carries the data to main memory 806, from which CPU 804 retrieves and 

^ executes the instructions. The instructions received by main memory 806 may 

optionally be stored on storage device 810 either before or after execution by 
CPU 804. 

The inventive semiconductor wafer inspection technique enables 
25 defects to be quickly and reliably classified into core classes such as particle 
or pattern defects, and further into subclasses of the core classes. Since the 
inventive methodology classifies defects into core classes by imaging the 
wafer surface and performing boundary analysis and/or topographical 
measurement of its features, it does not require examples of defect images for 
30 each class prior to being operational. Therefore, unlike prior art defect 
classification systems using classic classifiers alone, the present invention can 
be used during the start-up and ramp-up of a production line. Furthermore, the 
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present invention utilizes specific adaptive classifiers, tailored to expect 
defects from only a small number of core classes, to classify defects into 
variable subclasses as desired by the user, employing classic classifier 
techniques with reduced inspection time and increased accuracy. Thus, the 
present invention provides classification of defects upon start-up and ramp-up 
of a production line by using a core classifier, along with the flexibility of 
classic-style adaptive classifiers to allow the classification of varied subclasses 
of defects as the production line matures. 

The present invention is applicable to the inspection of any 
semiconductor wafer, and is especially useful for in-process inspection of 
semiconductor wafers during manufacture of high-density semiconductor 
devices with submicron design features. 

The present invention can be practiced by employing conventional 
materials, methodology and equipment. Accordingly, the details of such 
materials, equipment and methodology are not set forth herein in detail. In the 
previous descriptions, numerous specific details are set forth, such as specific 
materials, structures, chemicals, processes, etc., in order to provide a thorough 
understanding of the present invention. However, as one having ordinary skill 
in the art would recognize, the present invention can be practiced without 
resorting to the details specifically set forth. In other instances, well known 
processing structures have not been described in detail, in order not to 
unnecessarily obscure the present invention. 

Only the preferred embodiment of the invention and but a few 
examples of its versatility are shown and described in the present disclosure. 
It is to be understood that the invention is capable of use in various other 
combinations and environments and is capable of changes or modifications 
within the scope of the inventive concept as expressed herein. 



